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BANDWIDTH SAVINGS AND QoS IMPROVEMENT FOR WWW SITES 
BY CACHJNQ STATIC AND DYNAMIC CONTENT ON A DISTRIBUTED NETWORK 
OF CACHES 

5 TECHNICAL FIELD OF THE INVENTION 

The invention relates to the storage ami transport of informal ion oo an Internet and* more 
particularly, to caching Wcb-psge content on the Internet. 

CROSS-REFERENCE TO RELATED APPLICATIONS) 
10 This is o continuation-m-port of commonly-owned, copending U.S. Provisional Patent 
Application No. 60/2 1 8,559, filed 1 7 Jury 2000, 

DACKG RQ UND OF THE I NVENTION 

When two or more computers are connected so that they can share resources, this is referred 
15 to as a "network". When two or more networks ore connected together, this is referred to as 
an "interj>et" flower case The "mtemel" (upper cose T) is a a global. intend, which is a 
plurality of inter-connected networks, oil using o common protocol xriown as TCP/IP 
(Tnmsmission Control Protocol/Internet Protocol). 

20 The World Wide Web ("WWW", or simply •Web*) is n set of standards and protocols that 
enable the exchange of information between computers, more pariicutaily hypertext (HTTP) 
servers, on the Internet, tying tbem together into a vast collection of interactive multimedia 
resources. Traffic across the .Internet passes through multiple "backbone" carriers, such as 
UUNct or Digcx. which forward the signals to one another under a *ystem called peering. 

25 

"Quality of Service" (QoS) is a measure of the speed and reliability wlih which Information 
can be transported from a eootent source (typically a server, ot an Internet Service Provider, 
ISP) to a user's computer (upon which is typically running a type of "browser" software t)>at 
is used to view content). "Bandwidlh" is a measurement of tlie volume of information that 
JO can bo transmitted over a network (or a given portion thereof) at a given time. The higher me 
bandwidth, the more data can be transported. Latency is o measure of the time tjiat it takes 
for a packet of data to traverse Die Internet, from source to user. 

Bandwidth can be increased by instuiUng new capital resources, but increasing bandwidth 
35 only induectly addresses latency, and increasing bnndwidth cannot ijccctcratc overloaded or 
slow origin servers. Other solutions are needed to overcome these barriers. One solution, 
caching content, is discussed at length in an article entitled "Network Caching Guide, 
Optimizing Web Content Delivery", by Michael A. Goulde, March 1999, which is available 
online (via the Internet) at die following address: 
40 htlp;/Avwwuu\tomi.cx>m/r^ 

A text version of the article is available online at iho following address: 
Mrp://ww.v.gCK>gle.contf5earch?q^ache:www.^ 

fech_iesotttces/tcch/cachj ngguide.pdf 

« As noted m the £pujdg article: 

"One way to improve performance and make it more predictable is to 
minimize the variability introduced by long trips across the Internet from the 
content source lu the user's brt^vser. By storing frequently accessed content at 
50 . a location closer to the user, a great ileal of bteocy ami unpredictable delay in 
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the Inleroet can be eJiminated. 'fhc technique for doing this is called caching. 
Caching is a means of storing content objects from a Web server closer to the 
user, where they can be retrieved more quickly. The storehouse of objects, 
including text pages, images, end olber content, is called a Web cache 
S (pronounced cash)." (Gouldc, page 2) 

Caching is an upprviich (bat improves Qo3 tor users whUe improving the 
economics for network service providers. Caching entails storing frequently 
accessed Web cuntent closer to users, thereby reducing the number of hops 
10 * thai must be traversed in order retrieve content (hnt resides on a remote she. 
IGiMfepageS) 

In operation, requests tor content originate from a user's browser and arc first 
directed to the caching server. If the content is currently contained in the cache 
15 and is up to date, the content is sent to the user without the browser 's request 

having to be sent on to the originating server. This can reduce the latency for 
the transfer of content and reduce the amount of time it takes to transfer the 
content to the user. 11 also reduce the amount of data the service provider has 
to retrieve upstream from the Internet to fulfill requests, f Gouldc page 8) 

20 

Caching is a concept that is well understood in computer hardware design. 
Modem microprocessors employ on-chip caches of high-speed memory lo 
store frequently used instructions and data locally instead of having tn read 
them from slower n>emory. Compute;* implement several additional levels of 

25 caching, including RAM cache and even on-disk cache, all designed to reduce 

the latency for reading instructions and data and speeding op Ihe transfer of 
dala within the system. The basic principle behind caching is to store 
frequently accessed data in a location that can be accessed more quickly tban 
it could from the data's more permanent location. In the case of a system's 

30 CPU ; the permanent location is tbc disk. On the Web, the permanent location 

is the origin server somewhere mil on the Internet. (Gouldc . page 8-9) 

Network caching, while a newer concept, is based on exactly the same 
principle and has lb* same benefit of cost-effcctivcJy improving the user 
35 experience. (Ooulde, page 3) 

"Caching provides direct benefits to the end user in terms of reduced latency 
of Web puge download (the lime you have to wait before anything starts to 
happen) and faster download speeds (the time rl toVes tor all the downloading 

4t) to finish). Bui caching also provides benefits to network service providers. By 

storing, or caching, Web content that users request from remote servers 
locally, fewer of these requests have to be sent out over the Internet to he 
fulfilled. As a result, the access pruvider maintaining a cache has its upstream 
bandwidth requirements reduced. This reduces (ho bandwidth the servjee 

45 provider lias to purchase in order to provide its customers with a satisfactory 

Web experience. (Gouldc. page 3) 

"Caching also provides several benefits in a Web hosting environment in 
which an access provuler hosts Web sites for many customers. If can reduce 
so the impact of traffic spikes caused by high-interest content and also serve as 
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the basis for a variety of value-added services, such as providing additional 
capacity, guaranteed service levels, and replication services. (go ijld c , page 3) 

"[Caching] network data makes it possible lo provide users with an optimal 
experience and predivijblc response tiroes. With typical cache hit rates, the 
user experience has u higher quality of service (QoS). Improved QoS provides 
significant benefits to ISPs, content providers, and corporate site*. Better QoS 
results in higher customer loyalty and retention, ll helps create a stronger 
brand equity, both for the access provider nixl fox content providers. Content 
providers who ensure (Kit their content is amply cached throughout (he 
network , close lo users, will ultimately see more visitors accessing their 
content. (Coulde. page 5) 

"Web browsers also implement n form of caching, storing recently accessed 
Web content on the user's Iturd disk and reading from that local cache of files 
instead of occessing the Internet. This works wcD when the user hits the 
'Back" ond "Forward" buttons in their browser during a session, hut it docs 
nothing if the user is browsing a site for the first time. (Gouldc, page 9) 

"Neither the browser's cache nor a "Web server's cache can address network 
performance issues. By placing a cache of Web content on the network 
between the user and ibe originating Web sites, the distance that corooiordy 
accessed content has to travel over the Internet is reduced, and users 
experience quicker response and faster performance. Network caching takes 
advantage of the fact that some content is accessed more frequently Chat oilier 
content. By implementing o caching solution, network service providers can 
provide a better Web experience to their customers while reducing the total 
requirement for high- bandwidth connections to the Internet. (Cooldc. page 9) 

•When a caching solution Is not in place, requests for content from a browser 
and the content delivered from the origro server must repeatedly traverse a 
complete trip from the requesting computer to the computer ihat has the 
content. The Web browser scuds a request for a Uniform Resource Locator 
(URL) that refcre to a specific Web page on a particular server on the Internet 
The request is routed to the server through the normal TCP/IP network 
transport. The content requested fjoro the Web server {also known as an HTTP 
server) may be a static HTML page with links to one or more additional Jiles, 
including graphics. The content may also be a dynamically created page that is 
generated from a search engine, a database query, or a Web application. The 
HTTP server returns the requested content to the Web browser one Gle at a 
time. Even q dynamically crested page ofleu has static components that are 
combined with the dynamic content to create the final page. fGoulJc. page 
13) 

"When caching is used, frequently accessed content is stored close to the user. 
11 may be stored inside a firewall ou the user's corporate LAN, at the user's 
ISP, or at some other Network Access Point (NAP) or Point of Presence (POP) 
located closer lo the user than the majority of Web servers. If the requested 
content, or file, is located in (be cache and is current, or fresh, that is 
considered a cache hit. The more frequently user requests can be served from 



(34) 



JP 2004-504681 A 2004.2.12 



WO0MJ7J64 PC-I7IUH/D005I 
4 

these caches, the higher the hit rate, Uw better the user 'a performance, and the 
less duta iht service provider has to retrieve Tmin Uw Internet on behalf of the 
user. (Gouldc. page 1 4) 

j "Similar issues exist for FIT tile transfers, with an FTP server handling each 

request for u fite submitted by the KTP client application. Delays and 
bottlenecks can be u bigger problem with FTP because the Epical size of a iiJc 
transferred by FTP is larger then a typical HTML fife. Streaming audio and 
video are additional examples of an Internet application thai can benefit from 
10 ci^bijig content close to cad user*. Ifloolde . page 14) 

"Network caching can be applied to 'content delivered over many different 
protocols. Ifcese include HTTP, NNTP, KIP, RiSP, and others. All are 
diaracterized by having some proportion of static content and high utilizatioa . 
15 Cache server support for each protocol is, of course. iequircd." (page 14) 

"Since the cncltc must be kept fresh, there will still be traffic from the ISP out 
to the Interact, even if every bit of content requested by users is in o cache 
Page freshness has to be assured, and new content must be downloaded But 

20 by using caching, bandwidth utilization can be optimized. It is even possible to 

u$e a cache with dynamic content, sinoe even these pages have some static 
content that can be served from a cache. Depending on the distribution of 
traffic ami the scalability of the cache, up to 40 percent of user HTTP requests 
can be taken off the network ami served from a cache. This makes networks 

35 more efficient, enabling better paformAnce to be offered at lower cost." 

The Gould article discusses the effective use of caches, load balancing, where to locate 
caches in the infrastructure, and designing cache- friendly web content. There is also 
30 mention of protocols which have been developed - for example, Web Cache Control Protocol 
(WCCP) (page 1 8). There is also discussion of appropriate use of the "expire" and 'max- 
age" headers in the HTTP protocol. (Goujdc, page 27). And, as expressly stated by Gould, 

"With good design even dynamically generated pages can benefit from 
3 j caching. By separating the dynamic content of n page from the static content. 

the static content con be cached and the dynamic content retrieved and 
downloaded separately. (Gouldc. page 28; emphasis supplied) 

It is therefore known to cache static data from Web sites using a dedicated computer's RAM 
•io and hard disk, so that this computer can act as a proxy between the WWW ("web") servers 
and ihcir users. Several requests Tor a given Web page can then be served si the cost of a 
smgje request to the original Web server. Internet Service Providers (ISPs) commonly 
provide such a caching scrvico to their customers. 1 fowever, this technique suffers from two 
main drawbacks: 

J5 - it is nut applied globally, and even users of the Web sites who have access to o cache 

have tn deliberately activate this feature, nf which they are oflen not aware; and 

- it docs not apply to pages which encage very often, as is the case with dynamic 
content (content which is generated on-thc-fly by the remote server). 

50 There therefore exists a now! for the caching of dynamic content, as well as static content. 
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GIoTsnry/Deflni/loni 

Unless ouierwise noted, or as may be evident from I he context of their usage, any terras, 
abbreviations, acrooyms or scientific symbols anil nutations used herein ore to be given their 
5 ordinary meaning in the technical discipline to which the invention most nearly pertuiui. The 
following glossaiy of terms Is intended to lend empty and consistency to the various descriptions 
contained herein, as well as in my prior art documents which may be cited: 

10 Cache Server A highly opt imiztd oppKcatioo that sturw Irequentty accessed content 

at strategic aggregation points dose to the users requesting ihat content 
in order to reduce the impact of delays and network bottlenecks. 

CARP Cache Array Kouting Protocol. A protocol for synchronizing 

15 multiple cache servers, CARP maintains a shared namespace that maps 

any Web object's address (URL) to only one node in Ihe array. 
Requests ore routed to that node. 

Cookie The most common meaning of "Cookie** on the Internet refers to a 

2ti piece of information sent by a Web Server to a Web Browser that the 

Browser software is expected to save and to send back to the Server 
whenever the browser makes Additional requests from the Server. 
Depending on (he type of Cookie used, and the Browser's settings, the 
Browser may nccepi or not accept the Cookie, and may save the 
25 Cookie for cither a short time or a long rune. Cookies might contain 

inforroaiion such as login or registration information, online "shopping 
carl" information, user preferences, etc.. When a Saver receives u 
request from a Browser that includes a Cookie, the Server is able to use 
the information stored io Ibe Cookie. For example, the Server might 
30 customize what is sent back 10 the user, or keep 3 log of particular 

user** requests. Cookie* arc usually set to expire after a proleicnnincd 
amount of time and aje usually saved in memory until the Browser 
software is closed down, at which time (hey may be saved to disk if 
their "expire time" has not been reached. 

35 

Dynamic Content "Live" content which is updated on u regular basis. Examples of 
dynamic content might include a "eunenl tempe^aiare ,, display on a 
weather web site, search results, or n "Ctinent Top Headlines" item on 
a jkws web silc 

10 

HTTP Server A server that implements Ihe HTTP protocol, enabling it to serve Web 

pages to client agents (browsers). HTTP Servers supporl interfaces so 
that Web pages can caJl external programs. They also support 
encryption mechanisms for securely exchanging information and 
t> authentication and access control mechanisms lo control access to 

content 

ICP Internet Cache Protocol. A protocol for syncrironmng multiple cache 

servers. Koch time a cache server experiences a miss, it broadcasts 
50 messages to all peer nodes asking whether each has the content. The' 
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. requesting server then must issue a request for the content and forward 
it oo to the user. 

Proxy Server A proxy server octs as an inlcrmediary between a user .met Ibe Internet 

so on enterprise eon ensure security njid administrative control and nlso 
provide a caching service. A proxy server is usually associated with or 
part of a gateway server that separates the enterprise network froui the 
outside network, or a firewall that protects (he enlorprise network from 
outside intrusion. 

Routers Routers are the devices that build a fully interconnected network out of 

a collection oTpoinl-Uvpoiiit links. Routers un the Internet exchange 
biformation pertaining lo their local section of the network, particularly 
how close they are topologicaily to local systems. They collectively 
build a map of how lo gel from any point in the internet to any other. 
Packers ere routed based oo the exchanged mapping intbnration, until 
the lusi router connects directly lo the target system. 

Static Conteni "Fixed" or long-term unchanging components of web pages stored as . 

fijes that are citJwr never changed or arc changed onlv on an mfreqtient 
basis. 

Switches • High-speed network devices tbul typically sit on the periphery of the 

Internet. Switches djjft'or from roulers in providing higher performance 
at a lower price bitt with limited functionality. Typical switches can 
route traffic locally but aren't concerned with complexities of routing 
found in the high-speed Internet backbone. Switches play an important 
roje io caching because Ibey arc often tised lo divert the cavhetbtc 
traffic to the caching system. 

Hypertext Markup language. A specification based on Standard 
Generalized Markup Unguoge (SGML) for tagging text so thai it may 
be displayed in n user agent (browser) in a standard way. 

Hypertext Transmission Protocol. An application-level protocol thut 
runs on .top of TCP/IP, which is the foundation for the World Wide 
Web. 



HTML 



33 HTTP 



IP Address 



Internet Protocol. Tbe network layer for the TCP/IP protocol suite. II is 
a connectionless, best-effort, packet-switching protocol. 

A 32-bil address defined by the Internet Protocol that is usually 
rcpescnted in decimal notation. It uniquely identifies cacb computer 
on the Internet. 



Protocol An agreed-upon set of tcchnicnl rules by which computers exchange 

information. 

UK I . Uniform Resource Locator. The method by which Jnteroct sites are 

addressed. It includes an access protocol and either an TP address or 
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DNS name. An example is hHp:/fwww,ik>nJuin.cwru 

Uscnd Short far User's Network. A collection often* of thousands of bill Irtin 

boards residing on the Internet. Each contains discussion groups wJJcd 
newsgroups dedicated to various lopivs. Messages are posted ami 
responded to over ihe Network News Transfer Protocol (NNTP). 

Web Server Sec HTTP Server. 
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BRIEF DESCRIPTION OK THE INVENTION 

An object of the invention 1o provide a technique foi reducing bandwidth usage ofWW 
servers and improving the QoS of WWW vilcs. 

5 

According to the invenrion, a tecluiiq.iK: for caching objects having dynamic content on the 
Internet generally comprises disposing a cache in the Internet for storing und updating copies 
Of dynamic content. The cache may be disposed at a location selected from the group 
consisting of major Internet switching locations, dial-in aggregation points, and corporate 
10 gateways. The cache may also store and update copies of static content. 

According to a feature of the invention, update characteristics of the objects are determined, 
anil a time to live flTL) parameter for the objects is adjusted based upon the update 
characteristics. Generally, the object is updated if its T7L is less than its n<x. The TTT, for 
15 an object may be adjusted to: 

(i) maintain its probability of error below a predetermined error probability threshold; 
(ji) maintain its error rate below a predetermined error probability threshold; or 
(in) maintain its delay time below » predetermined delay threshold. 

20 According to the invention, a method of responding to a user request far an object having 
dynamic content, said object originating from a server, comprises storing a copy of the object 
in 0 cache; establishing a time to live (TTL) for the object; receiving the user request at the 
cache; 

fulfilling the user request with the stored copy of the object if its TTL is greater than it* age; 
25 and fetching an updated copy of the object and responding to ibe user request with the 
updated copy if the TTL of (In stored copy is less than its age. 

According to a feature of the invention, the TTT.. for the object is first set to o reasonable 
lower limit (Troin) and is than adjusted based on the frequency ul which the object actually 
30 changes. 

According to a feature of the invention, each time the cache fetches like object from the 
server, (he cache performs tho following procedures: 

a. if another fetch for the same object is ongoing, waiting far the previous fetch to 
35 complete; 

b. felchin- the object from the server, 

c. replacing the cached copy, if present, by the fetched object, after having compared 
them to determine whether the object had changed since it was last fetclicd; 

d. initializing or updating the object's change statistics accordingly; 

40 e. marking the object ax static or dynamic coolcut depending on the server's reply; and 

f. it" the object is dynamic, setting it* TTL (7) to an appropriate value with respect to 
an overage, time between changes (t) determined from the object's' change statistics, the 
numbeT of user requests per time unit (rtf determined from the objects access statistics, and 
one of the following procedures (A-E): 

A. maximum error probability; 

B. maximum error rntc; 

C. maximum delay; 

P. any combination of the above (A.B.C). taking the lowest if; or 
E. any combination of the above (A.B.C). but keeping T within a predetermined 
50 window of •rcawnable - values bounded by fain and Tmax. 
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Preferably, die caches arc dedicated machines end ore placed so thai Web browsing passes 
through Ihe cache instead of going a)) the way lo the original silcs, in many different 
locations, ideally within (ho nerworlc of ISPs providing tbe Internet cimaccliviry to Ihe highest 
3 number of users tn those locations. In this manner: 

• the users of those ISPs and, to a fcasw extent* neighboring ISPs, will orjuy a huge 
QoS and speed improvement, for most of the traffic wil) stay within or close to the ISPs* 
internal networks and not need to go through the highly-loaded Internet backbone; and 

- the original web-sites will r.n longer need as much bandwidth, since the caches will 
1 0 absorb most of the load. 

Since marry web-sites tend to serve all- dynamic content, dynamic content wjJI be cached, «s 
well ax static content. However, it is not merely a matter of the caches remembering, tbe 
content, as there is no indication as to when the latter is going to change. Hence, it needs to 
15 be reloaded, periodically, instead of simply serving the cached copy. 

Iherefbre, the cache reloads a page (fetches an object from the server), whenever its 
corresponding cached copy has not been rcfrcslied for a given time (time to live, TIL"). 
TTL can be thought of as the "shelf life" of the page. The cache first, sets ihe TTL for a 

20 dynamic object to a reasonable lower limit T«b. Then, over time, as the cache reloads the 
page several times, it can track the jfrequency at which the page actually changes content, and 
adjust the TI L for that page accordingly* This "TTL" tedinique mimics a common caching 
method for static pages, if the original server of the page specifies a TTL. But since server* 
for dynamic pages do not specify a TTL for them, tbe cache has to establish a reasonable 

25 TTL of its own accord. 

Therefore: 

- whole web- sites can be cached, including (heir dynamic content; 

- traffic betwetn caches and the original sites is better managed; 

JO - while some users might get an outdated content, in the ease of a page changing 

before the cache reloads it, it is possible to limit the delay between the two (which could 
happen anyway witliout the cache, when the network gets saturated) as well as the error 
probability; 

- the system can adapt, in real time, according to the number of requests to each page 
35 and the uc Ural update frequency of the page. 

Other objects, features and advantages of the invention will become apparent in light of ihe 
fulloM'mg description thereof. 
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BKIEFDESCR IPT10 N OF THE DRAWINGS 

Reference will be tninJe in detail to preferred cnibodinieols of the invention, examples of which 
may be illusJmted in Ibc Mtcompunymg drawing figurea The figures iire intended to be 
illustrative, not limiting. Although Hie invention is generally described in Ibc context of these 
5 preferred embodinvents, it JkxiW be understood thai it « ntrt uifaidcd lo limit Ibc spirit rod 
scope of the invent if in to these particular embodiments 

In flowcharts presented herein, rectangular boxes generally represent a sequential step being 
performed, a generally recUuigiuar box with pointed ends reprweolj u decision step (test) 
to having two mnhwlly-exchojve results ("Y^Yer, "N'MJo), »rd an empty cwde is not a step or 
a test, but is merely n graphical junction point at which two nr more paths in Uie flowchart 
converge. 

The structure, operation, and advantage? of the pjesem preferred embodiment of the invention 
1 5 will become further apparent upon ccnsrideiation of the following dewriution, taken in 
conjunction ■with (be accompanying figures, wherein: 

fig rati 1 is a greatly simplified schematic illustration of Oie Internet, illustrating an 
embodiment of Hit caching system of the present invent ion; 

20 

JFIgnre 2 is a flowchart illustrating how user requests for static and/or dynamic content are 
handled by the cache, according to the invention; 

Figure 3 is 8 graph illustrating an average errnr pcohahilify, according to an analysis of the 
25 invention; and 

figure 4 is a graph illustrating the evolution of error probability, according to an analysis of 
the invention. 
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PETAUJ-D DESCRIPTION OF TOR INVENTION 

Figure 1 is a greatly simplified schematic illustration or the Internet environment, illustrating 
.an embodiment of the caching system of the present invention. Generally, a user 102 makes 

5 h "request" Tor an "object" (e.g., f> Web page) which is mink available on Ihc Internet by a 
server (e.g^ ISP) .104. (The object typically originates at a conlent provider, not shown.) A 
switch 106 interfaces a cache (cache server) 108 to the Internet. The cache may contain a 
copy of the Web page. There are two possible ''responses" to the user request - either lie 
server "serves" (or "services'*) the request, or it is "fulfil Jed" in the cache. In the totter ease, 

10 the content must first have been "transferred" to fbo cache, which may periodically ■fetch" (or 
"reload") updated Web page conttDl from the server These terms will be adhered io, to the 
extent possible,. in the discussion thai follows. 

As discussed in the Gouhte article (eg, page 18, Illustration 4), the switch 106 mny be a 
JS. bigli-pcrformance processor that can look at network traffic aod ma ke rou ting decisions based 
on protocols above the IP level. As a result, the switch can direct HTTP (and oiher) traffic to 
caches (108), and send the rest of the traffic directly to the Internet. Tins is exemplary of 
one of a number of possible system architectures. Which architecture is used depends ou 
several factors, including where the cache is implemented, the primary purpose of the cache, 
2D end the nature of the traffic 

As further discussed in the GouJde article: 

'Globes «ui either be deployed in a transparent or oontrnnsparem form. A 
25 nonlransp.uent cache is explicitly visible, und browsers or other caches that 

use the cache are overtly configured to direct traffic to the cache. In this case, 
lac cache acu as a poxy «gcnt for (he browser, fulfilling requests when 
possible and forwarding requests to tbe origin server wljen necessary. 
N outran spa rent caches ore often a component of a larger proxy server octine 
30 as pajt of a gateway or firewall and addressing many different applications 

(fiouMc, pages 1617) 

A transparent cache shs in the network flow end functions invisibly to a 
browser. For ISP and enterprise backbone operations, a transparent 
35 configuration is prefored because it minimizes the total administrative and 

support burden of supporting users in configuring their browsers to find the 
cache. tCiouldc. pace 17) 

Caches should be implemented transparently to maximize the benefits of 
40 caching. A nontninaparcni implementation requires having browsers manually 

configured to direct their requests for coment to the cache server. In a 
transparent configuration, cache benefits arc delivered to clients without 
having to reconfigure the browser. Users automatically gain the benefits of 
caching. (Goulde . pngg J 7) 

45 

In an enterprise environment, transparency can be implemented cither through 
automatic browser configuration or by intercepting traffic on the network. 
Both Netscape and Microsoft provide ntilities for centrally configuring huge 
networks of browsers and for customizing browser configurations being 
SO installed on users' PCs. Browser plug-ins can also provide automatic 



(42) 



JP 2004-504681 A 2004.2.12 



prr/iioii/no65i 



con figuration. Although Ihis approach is transparent to the oscr, il does 
require administrative effort on an ongoing basis. (GpAdde, pnge 17) 

There arc several uptions for providing transparent caching: 

- The cache can be configured as if il were a rouicr so lhal all lntrmet-buscd 
troflw is oiincd at it. This is a transparent configuration lhal requires no 
con figuration of the browser; the browser or doAvnstream cache is unaware of 
the cache's existence but still benefits from il. Tlie downside is that the system 
on which U« cache resides has to devole some of its resources 10 routing, and 
the cache become? a mission-critical part of the network. .Sophisticated router 
configuration with policy-based routing can minimize some of these issues by 
only directing HTTP (TCP Port 80) iraffic to the cache, bypassing the cache in 
the event of failure and sending traffic directly to iho Internet. ( Goultlc page 
17) 

- An increasingly popular option is to rise a Layer 4 switch to interface the 
cache (o the Internet (sec Illustration 4). Jhese switches, currently offered by 
Alteon, Foundry, ArrowPoinl. und others, are high-performance processors 
lhai can look at net work traffic and make routing decisions based on protocols 
above the JP JeveL As a result, the switch can direct HTTP (and other) traffic 
to the caches and send the rest of the traffic directly lo the tuternet ... f l'Jhe 
switch can parse the HTTP request and send the request to a specific node in a 
cache form based on the URL requested. Using an intelligent switch keeps 
luwcessary network traffic off the cache, simplifies designing for arvailah'tlity, 
and distributes loading on the cache farm based on specific URL?. (Goul d c, 
page 18) (An architecture similar to this one is described hereinabove with 
respect to Figure 1) 

- Another option for transparency is the Web Cache Control Protocol 
(WCCP). WCCP was developed by Cisco Systems to allow Web caches Lo be 
transparently installed in networks using Cisco lOS-tmsed routers. With 
WCCP, HTTP traffic Is redirected to the Web cache instead of the origin 
server. WCCP does not require any changes lo the network architecture, 
thereby simplifying Ibc iroplementutiuo of transparent Web caching. (Oouldc. 

"page IB) 

- Web Proxy Autodisco\'cry Protocol (WPAD) is a new proposed standiud 
protocol, which, when integrated with browsers, streaming media clients, und 
other Internet client software, is designed to automatically locate caches and 
services on the network without requiring any configuration by end users. 
WPAD provide* a flexible, vendoi-ncnhBj soflwure alternntive to existing 
cache transparency solutions diat utilize routing or switching equipment In 
the fpiurc, WPAD-cnaWcd client software will automatically connect users 
wiUi embedded network services in their region, providing simplicity for botb 
users nnd the network providers that tleploy these service*. (Goulds , pages 
HM9) 

CucJung systems can be used to optimize the performance of a Web server site 
as well as to speed Inremel access for Web browser users. In a reverse proxy 
configuration, the caching system ails in iron I of one or more Web servers, 
intercepting iraffic to those servers and standing in. or proxying, for one or 
more of tlic servers. Cache servers can he deployed ihroughoiit n network, 
creating a distributed network for hosted content. When necessary the proxy 
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cache server will request dynamic end other short-lived content from the 
origin servos. This enables content from the Rite to be served from a local 
cache instead of from Uie origin server. The proxy server c«n be optimized for 
high performance, efficient operation, conserving resources, rod off-loading 
5 the origin server from serving static contejn. Reverse proxy caching provides 

benefits to the access provider as well as to the user. Those benefits inch»de 
the ability to enable load balancing provide peak-demand insurance to assure 
availability, and provide dynamic mirroring of content for high availability. 
(poolde. page 20) 

?0 

There ore three genera) characteristics that describe where caches are best 
located on a network: 

- Choke Points in the Network Traffic There we locations where a large 
majority of network traffic posses and is therefore visible to the cache server. 

15 ThU allow the enche to handle more requests and store more content than if 

located somewhere that can be easily bypassed. 

- Points with High-Network Load. High traffic conditions nllow higher 
cache utiliziiliun and therefore greater benefits can be achieved. 

- Locations that Produce Greatest Economic Benefits for a Giche. Points 
20 where users will benefit from high cache hit rates while also rerhicing 

upstream bandwidth requirements- will provide both QoS benefits and 
economics tor the access provider. locations with these cliaractcristics are 
typically found at major Internet .switching locations, dial-m aggregation 
points, or corporate gateways. ( fipuldc, page 20) 

25 

Locations with these characteristics are typically found at major Internet 
switching locations, dial- in aggregation points, or corporate gateways, 
including: 

POP and DIAL-UP (sec Oouid c. page 21, Illustration 7) 
30 NAPS (see Gouldc. page 22. Illustration 8) 

Satellite (sec Gouldc. page 22) 

International Gateway (see Gouldc page 23, Illustration 9) 
Web hosting (see Goulde. page 24, Illustration 1 0) 
Last Mile (see Gouldc page 25, Illustration 1 1) 
3 5 Coiporate Gateways (sec Goulde, page 25. Illustration 1 2) 

A person having ordinary skill in the art will readily understand where to locate the cache(s) 
of the present invention in tight of the description set forth hereinabove. Generally speaking, 
the cache of the present invention can 1* located anywhere that there is (or could be) a cache 
AO serving static content, or it con be incorponited into an existing cache which tulfiUs requests 
for static content, with the odditional functionality enabled according to the techniques set 
forth below. Or, it con be provided as a separate, dedicated machine (computer). 

Figure 2 is a flowchart illustrating how user requests for static and/or dynamic content are 
<15 handled by the cache. 

In a first step 202, for a user request for an object, the cache determines whether the 
requested object is in cache. If not (N), the user request is passed on to the server for 
servicing tlie request and meanwhile, in a step 204, the cache fetches the object from the 
5» server in anticipation of the next request for the object from the same or another mser. 
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Generally, in this example, all requests Tor objects are presumed ro go through a cache server, 
which is transparent to the me. II intercepts information requests and decides whether it will 
provide n response from a cached local copy or from o remote ^formation source. After 
fctchiog information from a local source, the cocho servrr decides whether lo store it locally, 
and if so, for how long. A request for inform a ticn which can be provided from a local copy 
is Known as a "cache hit". Conversely, a request for information which is not stored locally is 
known a "cache mis3". When Ihe storage 1 determination algorithm is well-designed the 
probability of a cache hit is greatly improved, and apparent response time to wser requests 
Related to QoS) in reduced Fanher, every Information request satisfied by locally cached 
consent (cache hit) reduces traffic on the external network, pcrrnilting shorter response times 
over the external network. 

]f the requested object is in the cache, it is oexl determined in a tfep 206 whether the 
requested object is marked as static. If so (Y), it is Ibcn determined in a step 208 whether lo 
updaie i he cached copy or lo use it to fulfill the user request using any siu table standard 
algorithm for caching sialic objects, such as comparing the objects *>gc" (the time elapsed 
since il has last been refreshed) to the TTL (If the original server of the page specifies a 
TTL), asking Die server the latest modification tune, etc). 

If the requested object is in cache, end il xs dynamic (N, step 206X il is detennined in o step 
210 whether ihe cached copy's TTL ("shelf life") is less than its age. 

If the cached copy's TTL is less fhon (a lower number than) its age (Y, step 21 OX it is 
considered to be "stale", and in a step 2 12 the cache: 

- updates the object's access statistics (number of user requests per time 
period - last few minutes, last hour, etc.-); and 

•- fetches the object from the original server. 

If Ihe cached copy's TIT. is equal to or greater than its age (N, step 2 1 0), it is considered to be 
"fresh", and in a step 21 4 Ihe cache: 

fulfills Ihe request using the caclicd copy; und 

updates the object's access statistics. 

Uptionalry, if the time difference between the cached copy's egc and its TTL is less than a 
given time, and Ihe number of recent user requests is more than a given rate, it is considered 
to be "aged" wkI "popular", and the cache fetches the object from the server in what is termed 
an "anticipated refresh". 

Addirionally, for each time the cache felchcs an object from the server (see steps 204 and 
212), the following procedures are performed by the cache, in a step 216: 

a. if" another fetch for th* same object is ongoing (eg, due to n previous user request), 
Ihe cache waits Tor the previous fetch to complete, rather than duplicating it request; 
- b. fetches the object from the server, 

c. replaces its cached copy, if present, by the up-to-date object, after having compared 
ihe/n to delermine whether the object had cljauged since it was last fetched; 

d. initialize or update the object's. change statistics (number of changes per time 
period) accordingly, 

e. mark the object as static or dynamic content depending on the original server's 
reply (presence of o modification lime, or of a nonzero TTL, etc.); • 
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f. if the object is dynamic set its TTL (7) to the appropriate value with respect to the 
average time between changes t (dttemuncd from the object's change statistics), the mrober 
of user requests per time unit r| {determined from the objects access statistics), and a selected 
urn; of ihe following pjocechues (A-E): 

A> maximum error probability ( p*\ which is the average ratio of the number of 
requests fulfilled using a cached copy whose corresponding original object has changed for 
more than a given rime tr, over Ibc total number of requests 



r=(jr-fr^Xn. i 



£. rnaxirmim error role ( «o ), which is the overage number on time unit of requests 
fulfilled using a cached copy whose corresponding original object has changed lor more than 
u given time \Y'. 
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& maximum delay ( D 9 ), which is the average time between an object change and 
when the cached copy is refreshed: 



0_ any combination of the above (AJJ.C), taking the lowest T, or 

30 

E, any combination of the above (A,B,C), but keeping T within a predetermined 
window of "reasonable" values bounded by jfiram and Tmax. 

In the above equations, the following prtrametcrs have been used: 
35 T is Time To 1 Jve for the dynamic object; 

W is a given time since the original object Iras changed (i.e., how long it is outdated); 
' x is an average time between changes, which is determined from the object's change 
statistics; 

n Is number of user requests per time unit (e.g., frequency); 
40 po is maximum error probability, which is the average ratio of the number of requests 
fulfilled using a cached copy whose corresponding original object lias changed for 
more than the given rime W, over Ihc total number of requests; 
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r|g is maximum error rate, which is the overage number per lime unit of requests fultilfed 
oring a cached copy whose" corresponding original object has changed for more than 
Ok given lime W\ 

Do is maximum dclny, which is (he overage lime between an object chnnye «nd when the 
5 cached copy is refrcsJied. 

Sefeci/ng and Adjusting TTX 
A simple mode), which takes no other parameter than the average ihnc between change* <x) 
of the object is the exponential decrease model - that is, the probability thot the object docs 
i o not change during a time period of length t is considered 10 be of the fnnn e *' \ It is, of 
course, 1 for r - 0, anil toxis to 0 as / tends to + (eo) 

Considering a situation with many user requests, as compared to the tefresh rate of the cached 
copy, so the cache will update at regular intervals, despite being user-driven. 

15 

Average Greoucncv of chances 

The model's consistency is oow checked especially wbclher the average update frequency of 
o given object is 1/ x. Lei p' r 0) be the probability that k changes over a time period of r. 
By construction: 

20 

And. for A> 0, p\ = {/^GOO-e^V^ - [pUW^*™* 



(the probability Ihot k changes occur over [0:1) is the sum over « of the 
25 probability that * - 1 changes occurred between 0 and «, that there was a 

cfiarige between u and rr+iln, ami none between mdu and f). 



One can deduce by recurrence that: 

30 




35 w 
„_J 1 

*Ir^ <U1 




45 
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Now the overage number of changes over n lime period of i is (by definition of overagmg): 



The error probability is defined as being the percentage of cases where the user receives the 
cached copy whereas the object has changed on the originn] server, ;ind the cached copy is 
to outdated by more than a given time W. Figure 3 illustrates the percentage error (vertical 
axis, in pcrccatagcs) versus lime (horizontal axis, io seconds) for instance, with W-15s, t - 
60s, end T~ 30s. lie graph illustrates tho probability that the cached copy is "stole" by owe 
than W seconds. In the graph, Wis 15 seconds, so the probability is zero between 1 and 15 
seconds. Then, the probubilily rises until 30 seconds, when u rc-fctch occurs [T- 30s), as 
I* illustrated buy tbc sawtooth pattern 302. Tbb pattern repeat? itself every 30 seconds. 
Between 30 nnd 45 seconds the probability is again zero, and a re-tctch occurs at 45 seconds, 
and the probability rises until 60 seconds, as illustrated by the sawtooth pattern 304. A 
similar result is shown by the sawtooth pattern 306 between 75 and 90 seconds. 

20 Given that the cache is supposed to update the copy of the object el regular intervals, the 
calculations can be jicrfbmicd over a time interval [0;TJ. 

The average time £ within the interval during which the content is outdated L» the sum over 
alj intervals fa; uHl„] of the probability that there waa no change between 0 and u, but that 
23 there was a change between u and a hi«, multiplied by the length of time during which the 
content remains outdated hy more than W. That is: 




t 



5 Hence, the changes do have an average frequency of 1/ X. 



Average error probability 
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Jo T T 

~{T-W){\~e ' )-A-^——e ' fl-e < I 



or, wrih « = ■ — 



A » r(a-l + «"*) 



From Ibis, the error probability can be deduced as: 
10 ton "^|«-l + 0 



Easier-lo-handlc bini nds tan be calculated by letting: jr( a <\ ra 5L JL~ + » - 1 4 e "* 
15 1 ' 6 2 



Its derivatives ajc: F* (a)n~--u4l~i ,HI 



P"(a)=)-e"* 

Since F(0) - F*(0) = F*XO) - 0, and F"(a) is positive for any a >= ft, F^a), P(a) and, 
ultimately. I : (U) ore as well. So: 
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There are better, «isy-to-find bounds for ibe case ot'u>=1 , but iter is no» an bilcresttog case 
(T-W >~ 1). Heoce. 
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Mnxlmni TTL to ensure a given error probability 

Au upper bound should be set on TTL which still ensures that the error probability is below a 
given ihreshold p». 



Par s pO (a -1 + «"*)£/>, 



T 2 r ' 
r 
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Therefore, an acceptable TTL is: 



Frmijii/oMJi 



Mwrimu l TTL to ensure ft given error rate 
10 ITiis is the same as for the enw piDbability, replacing y?i by tt 9 1 n is: 





> 


i 








ff 



















25 Therefore, nn acceptable TTL is; 



35 Average d elaj 

II is useful to consider the average delay /> between the enched copy and the original object 
in the case where the latter has changed during n given interval. Tirst, calculate <At(t/y> 
which is the average delay knowing that the change occurred at a time « between 0 and T. 

IT 
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From this, the overall delay <A/> can be tleduccd by integrating <An»> times the pmbability 
tbal a chnnge occurred between « rmd » » d fl . aver (0;r): 



-S'-"i(7j- J 7-"'^'"')) 



25 Since Ihc <Ar> calculation does not lake into account that there has been o tbongp within 
[OjJJ, it should be divided by the corresponding probability to get & 
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andsince: £L-5_ + a - l + c M iO 

6 2 



and: 

3 



Thau 



. 1 r'lfrV 



<_JL.I 



To order lo limit Z) to .Dp, a ffL should fc* chosen, ruch ay: 

T T 
6 2r 



T <A 
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Fiom Ihc nbwve. the following tables of Til, [T ($» versus a uunjlxr of enw probabilities, 
for various values of T aixl W can be derived; 





T = 30i 


&m 








OA 


. 15 


0 


33 . 


0.5 


17 


0.4 
""0.8 


■4.0 


1 


18 


4.3 


5 

10" 


23 


43 


62 


27 


7.8 


H2 



5 





W 15s 


T- 60s 






Oo-Ott 


DM 


0.1 


(6 


0.1 


3.1 


0.5 


IS 


. 0.4 


3.5 


1 


f9 


0.7 


3.8 


5 


27 


4.2 


5.8 


10 


35 


8-5 


8.2 



W ~ 15s, t — 600s 




LM 




J>(si 


O.J 


19 


0.1 


32 


0.5 


27 


0.4 


4.6 


1 


35 


0.9 


6.0 


5 


87 


4.8 


15.6 


10 


148 


9.3 


28.1 



?y-I5s l T« 3600s 




lis) 






0.1 


29 


0.1 


4.9 


0.5 


I 62 


0.5 


10.4 


1 


99 


10 


16.7 


5 


389 


4.8 


68.5 


10 


749 


9.3""" 


"139.3 



(55) 



JP 2004-504681 A 2004.2. 12 



«-nni»i/o 0651 







X-60s 
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From the above, the following conclusions can generally be made. 

15 The probability of the retrieved uifonnntioo being "stale* (older than W) is essentially zero 
for values of T(s) (Time to live) less than or equal to W and increases with increasing Tfs) 
according to a decaying exponential, approaching 100% probability of error at infinity. This 
observation that Uh; error probability is 7cro for values of T(s) less than or equal to W ii 
essentially a "trivial* result, since it is clear thtti no information can be older than W if it is 

15 updated more frequently than W. CJeruiy, however, the average update interval t has a 
significant dr«;t on bow steeply the error probability climbs ffnr values of T(s) greater than 
W. The j^ealer the average update interval t with respect to W. I be less sharply the error 
probability rises with increasing values of T(»> 

20 Figure 4 is a graph 400 illustrating tl>c evolution of error probability p m (vertical axis, 
expressed as a percentage (%)), as a function of the TTL (in seconds (s)) for various values of 
x (W -15s). The line 402 is for t - 20s, the line 404 is for x = 60s. the line 406 is for t = 
300», and the line 408 is for T - 600*. 

25 A graphical illustration of a technique for choosing o TIX.Io maintain error probability 
belw a threshold value j^is obtained by identifying the value of rjjon tl»c vertical axis of the 
graph 400 ond following an imaginary line horizontally across the graph 400 to where it 
intersects the curve (402,404,406, or 40S) for the appropriate value off. By way of example, 
if an error probability thrcsMd value of 10% is chosen, then the TTL for t =» 20s (line 402) 

so is a little untler 30 seconds, for x - 60s (line 404) it is about 35 seconds, for t - 3O0J (line 
406) it is about 85 seconds, and for -r » 600s (line 408) it is about J 50 seconds. 
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By deploying caches for static nnd dynamic content at appropriate locations to the Internet, 
and by selecting appropriate update idwracteristics for cached dynamic content as described 
hereinabove, effective user response times U> both dynamic and static content can be reduced 

• while simultaneously reducing congestion on the external network (i.e., the Internet). 

5 

The first time on end user (110) receives a web page, the web page may optionally plant a 
cookie in the user's browser, ihercafler, every time the end user .accesses the web page, the 
browser sends the cookie along with the access nqueM. The cookie lolls the server (104) 
what the server wants (or needs) to know about the end user, in addition to simply tbe fuel 
10 that the user wonts w retrieve the web page. r*or example, if Ibe web page is a perronafoed 
web page of n single end user, the server knows the end user's preferences. 

According to u feature of the invention, the cucbe (108) con look ut the cookie and see if the 
requested web page is a personalized pa ye of a single end user, or if the cookie indicates (he 
15 access request is coming from a global cixl user. If (he requested web page is a personalized 
web page for a angle end use/ (of if there is some other hicikatioo that the web page is 
supposed to be modified each time that it is accessed), tben foe web page should not be 
' cached. This can be accomplished by setting T1L = 0. Else (eg., glabal end user), TTL is 
si mp)y set as described hereinabove. 

20 

Although the invention has been described with respect to a limited number or embodiments, 
it will be appreciated that many variations, modifications aj>d other applications of the 
invention may be made, and are intended to be within the scope of the invention, 03 disclosed 
herein. 

25 
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CLAIMS 

What is claimed is: 

1 . System for caching objects laving dynamic content on the Internet, comprising: 

a cache connected in the Internet foi storing and updating copies of dynamic content. 

2. System, according to claim 1 , wlierein: 

(he cache is disposed at a location selected from the group consisting of major Tnternel 
switching locations, dial-in aggregation points, and corporate gateways. . 

3. Systen*, occording 10 claim J , wherein: 

the cache also stores and updates copies of static content. 

4. System, ticconlj ng i o cla im 1 , further comprising: 

means for monitoring one or more of the objects to determine update characteristics 
thereuf; and 

means lor adjusting a time tn live fTTL) parameter for (he objects based upon tjtc 
update characteristics. 

5. System, according to claim 4, further comprising: 
means for determining an age of on object; and 

means for updating the object if »he TTL for (he object is less than its age. 

0 System, according to claim 5, wherein: 

the Til for each object is calculated according to an average time of changes for the 

object. 

7. System, according to claim 4, further comprising: 

means for determining a probability of error for cich of the objects; and 
means for adjusting (he TTL fox each of the objects to maintain i ts probability of 
error below a predetermined erroT probability threshold. 

8. System, according to claim 7, wherein: 

the TI L for each object is calculated according to the equation: 




wherein: 

T Is ihe Time To Live, for the dynamio object; 

W is a given lime since the original object has changed (i.e., how long it is 
outdated); 

1 ts an average time between changes, which, is determined from the object's 
change statistics; and 

po is nwAimum error probability, which is the average ratio of llie number of 

requests fulfil Jed using o cached copy whoj* corresponding original abject has 
changed for ro^re lhan live given lime W, over the total number of rcqnc.Ms. 
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9- Syslero, according to claim 4, further corn|>rising: 

means for determining an error role for each of the objects; and 

means for adjusting Tft for each of Ihe objects to maintain its error rale below a 

predetermined error probability threshold. 

JO. System according to claim 9, wherein: 

Ihe TTL for each object is calculated according to the equation; 



wherein: 

T is (he Time To Live for the dynamic object; 

W is a given time since the original object has changed (i.c, bow long it w 
outdated); 

T is an average time between changes, which is determined from the object's 
charts statistics; 

rt is number of user requests per time unit (e.g., frequency); and 

no is maximum error rote, which is the average number per time rtnit of requests 

fulfilled using a cached copy whose corresponding original object has changed 

for more than the given time W. 

1 1 . System, according to claim 4, further com pi i sing: 

means for determining a delay time for each of tbo objects; and 

means for adjusting TfL for each of the objects to maintain its detoy lime below n 

piedetemiined detay threshold. 

12. System according to claim 1 1, wherein: 

the TTI. for each object is calculated according to the equation: 



wherein: 

T is the Time To Live for the dynamic object; 

* is an average time between changes, which is determined from the object's 
change statistics; and 

Do is rtiaximcm delay, which is the average time between an object change and 
when the cached copy is refreshed. 

13. System, according to claim 4, further comprising: 

means for determining at least one object characteristic selected from tbc group 
consisting of error probability, error rate »nd delay time for each of the objects; and 

means for adjusting TIL lo maintom ihe selected object chamcteztics below a 
respective threshold value. 




6Z> tf 
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1 4. System, according to claim 1 3, further comprising: 

means for limitiug adjustment of TTL for each of the objects to a range bounded by 
predLMcnni'ncd minimum (7min) nnd muximum (7'in.\)C) values for TI L. 

15. Method of responding to a user request for information Iiaving dynamic content* 
comprising; 

storing a copy of tbc dynamic content in a cache; 
establishing a time to live (TTL) for the dynamic content; 
receiving the user request at the cache; 

responding to the user request with the stored copy of the dynamic content if its TI L 
is greater than its age; and 

retrieving an updated copy of the dynamic content and responding to the user request 
with the updated copy if the TTL of thr stored copy is less than it* age. 

1 6. Method, recording to claim 1 5, further comprising: 
storing a copy of static content in the cache. 

1 7. ■ Method, according to claim 1 5, further comprising: 

determining an average update frequency for the dymunic content; and 
detcnuixang the TTL for the dynamic content ns a function of its average update 
frequency. 

1 8. Method, according to claim 1 S, further comprising: 

determining an average update frequency for the dynamic* content; and 
determining the TTL for the dynamic content as a function of its average update 
frequency and a predetermined error probability threshold. 

1 9. Method, according to claim 1 8, further comprising: 

adjusting the TTL for the djuunic content according to a frequency of user requests 
for the dynamic content. 

20. Method, according to claJm 1 5, wherein: 

the TTL for each object is calculated according to the equation: 



rtfOr+rj*}!*.!! 



wherein: 

T is the Time To Live, for the dynamic object: 

W is a {riven time since tbc original object has changed (i.e.. how long it is 
outdated); 

T is an average time between changes, which is detennined from the object's 
chonge statistics; and 

is maximum error probability, which is the average ratio of the number of 
requests fuljilled using a cached copy whose corresponding original object has 
changed fox more than tlx; given time W, over the total number of requests. 



21. 



Method, according to claim 15. wherein: 
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Ibe m for each otiject is calculated according to the equation: 



W + r- 



wbcrcin: 

T is (be Time To Live for Ihe dynamic object; 

W is n given time since the original object has changed (i.e., how Jong it is 
ootdiited); 

T is an average rime between changes, which is determined from the object's 
change statistics; 

n is number of user requests per time unit (e.g., frequency); and 
rto is maximum error mtc, which is (he average Dumber per lime unit of requests 
• fulfilled using o cached copy whose corresponding original object has changed 
tor more then the given time IV. 

Method, according In claim 1 5, wherein: 

(he TTL fw each objecl is calculated according to the equation: 



6 D 0 



wherem; 

T is the Time To Live for the dynamic object; 

t is an average time between changes, which is determined from the object's 

. change statistics; and 
Do is maximum delay, which is the average time between an object change and 
when the cached copy is refreshed. 

23. Method, according to claim 15, wherein the information is represented as a web page, 
(be method farther conipiising: 

a iirst time the user receives the web page, the wfcb j>age plants a cookie in the user's 
browser. 

24. Method, according to claim 23, wherein: 

every subsequent time tho end user requests the web page, the browser sends the 

cookie. 



25. Method, according to claim 24, further comprising: 

the cache looks at the cookie aud sec if the requested web page is a personalized page 
of a single end user. 

26. Method, according to claim 23. further comprising: 
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if the requested web page is a personalized web page for a single end user, then the 
web page is not cached 

27. Method, recording to claim 1 5. further comprising: 

if the information 19 supposed 10 be modified each time it is accessed, selling TTL - 

0. 



28. Method of responding to a user request for un objeet having dynamic content, unid 
object originating from a server, comprising: 

storing a copy of the object irt a cache; 
establishing a time to live (TTL) for the object; 
receiving the user request at the cache; 

fulfilling the user request with the siored copy of Ihe object if its TTL is greater than 
its Qge; and 

fetching an updated copy of the object end responding to the user request With (be 
updated copy if the TTL of the siored copy is less than its age. 

29. Method, according to claim 26, further comprising, in the cache: 

first setting the TIL for the object 10 a reasonable lower limit (Tmin); and 
adjusting die TTL for the object based on the frequency at which the object actually 
changes. 

30. Method, according to claim It. further comprising'. 

each time the cache frtcbes the object from the server, performing the following 

procedures: 

a. if another fetch for the .wmc object is ongoing, waiting for the previous fetch 10 
complete; 

b. fetching the object from Ihe server, 

c replacing the cached copy, if present, by the fetched object, aflcr having compared 
them to rfetennjne whether the object had changed since it was last fetched; 
d. initializing or updating the object's change statistics accordingly; 

c. marking the object as static or dynamic content depending on the server's reply; ond 
f. if the object is dynamic, setting its TTL (7) to an appropriate value with respect to 

an overage time between changes (i) determined from the object's change statistics, the 
number of user requests per time unit (n) determined from the objecis access statistics, and 
one of the following procedures (A -E): 

A. maximum error probability; 

IS. maximum enor rate; 
' C- uMuirnuro delay, 

D. nny combination of the above (A.B,C), taking the lowest T; 01 

E. any combination of the above (A,B,C), but keeping T within a predetermined 
window of "reasonable" vahies bounded by Train and Tmax. 

31 . Method, accojdbg to claim 28, wherein the information is represented as a web page, 
the method further comprising: 

a first time the user receives the web paRe, (he web page plants a cookie in the user's 
browsei. 



32. Method, accoiding to clsun 3 1 , wherein: 
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r-ny eubccqit&ni time the end user requests the web page, the browser sends the 

coolie. 

33. Method* nccording to claim 32, rurtljer comprising: 

the cache looks at tbo cookie mid sec if tlw requested web page is n personalized page 
of n stogie end user. 

34. Method, according to claim 3 1 , ftirthti wmprinng: 

if the requeued web page is a personalized web page for o single end user, then ibe 
web page is not cached. 

35. Method, according to claim 2«, further comprising: 

if the information is supposed to he modified each lime it is accessed, selling 'ITL « 0. 
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